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ABSTRACT 

We investigate the possibihty of probing the large scale structure in the universe at 
large redshifts by studying fluctuations in the redshifted 1420 MHz emission from the 
neutral hydrogen (HI) at early epochs. The neutral hydrogen content of the universe 
is known from absorption studies for z < 4.5. The HI distribution is expected to be 
inhomogeneous in the gravitational instability picture and this inhomogeneity leads to 
anisotropy in the redshifted HI emission. The best hope of detecting this anisotropy is by 
using a large low-frequency interferometric instrument like the Giant Meter- Wave Radio 
Telescope (GMRT). We calculate the visibility correlation function (V5y(U)Vjy'(U)) at 
two frequencies u and u' of the redshifted HI emission for an interferometric observation. 
In particular we give numerical results for the two GMRT channels centered around 
V = 325 MHz and v = 610 MHz from density inhomogeneity and peculiar velocity of 
the HI distribution. The visibility correlation is ~ 10~^'^-10~^ Jy^. We calculate the 
signal-to-noise for detecting the correlation signal in the presence of system noise and 
show that the GMRT might detect the signal for integration times ~ 100 hrs. We argue 
that the measurement of visibility correlation allows optimal use of the uncorrelated 
nature of the system noise across baselines and frequency channels. 

Subject headings: cosmology:theory, observations, large scale structures - diffuse radia- 
tion. 



1. Introduction 



Various observations indicate that around 90% of the HI mass in the redshift range 2 to 
3.5 is in clouds which have HI column densities greater than 2 x lO^'' atoms/cm^ (Peroux et al. 
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2001, Storrie-Lombardi, McMahon, Irwin 1996, Lanzetta, Wolfe, & Turnshek 1995). These high 
column density clouds are responsible for the damped Lyman-a absorption lines observed along 
lines of sight to quasars. The flux of HI emission from individual clouds (< 10/iJy) is too weak 
to be detected by existing radio telescopes unless the image of the cloud is significantly magnified 
by an intervening cluster gravitational lens (Saini, Bharadwaj and Sethi, 2001). Although we 
may not be able to detect individual clouds, the redshifted HI emission from the distribution of 
clouds will appear as background radiation in low frequency radio observations. Bharadwaj, Nath 
and Sethi (2001; hereafter referred to as BNS) have used existing estimates of the HI density at 
z ~ 3 to infer the mean brightness temperature of ~ 1 mK at ~ 320 MHz for this radiation. 
The fluctuations in the brightness temperature of this radiation arise from fluctuations in the HI 
number density and from peculiar velocities. As shown in BNS, the cross-correlation between 
the temperature fluctuations across different frequencies and different lines of sight is related to 
the two-point correlation function (or equivalcntly the power spectrum) of density perturbations 
at the redshift where the radiation originates. The possibility of measuring this provides a new 
method for studying large scale structures at high redshifts. Estimates indicate the expected 
values of the cross-correlations in the brightness temperature to vary from lO"'^ to 10"* over 
intervals corresponding to spatial scales from 10 Mpc to 40 Mpc for some of the currently-favoured 
cosmological models. Estimates of the different contributions to the flux expected in a pixel of a 
radio image show the contribution from galactic and extragalactic sources and the system noise to 
be substantially higher than the contribution from the HI radiation. The task of devising a strategy 
for extracting the signal from the various foregrounds and noise in which it is buried is a problem 
which has still to be solved. A possible strategy based on the very distinct spectral properties of 
the foregrounds as against the HI emission is discussed in BNS. 

An alternate strategy for using the HI emission from high redshifts to study large scale struc- 
tures has been discussed by many authors (Subramanian k, Padmanabhan 1993, Kumar, Padman- 
abhan & Subramanian, Bagla, Nath, &; Padmanabhan, Bagla 1998). This is based on detecting 
the HI emission from individual protoclusters at high redshifts. There have also been observational 
efforts in this direction (see Subrahmanyan & Anantharamaiah 1990 and reference therein). No 
detections have been made till date. This strategy suffers from the disadvantage that the protoclus- 
ters correspond to very large overdense regions which are very rare events. Protoclusters with flux 
in the range 3 to 5 mJy are predicted to occur with abundances in the range 10~^ — 10""^ Mpc~^ in 
the CDM model (Subramanian and Padmanabhan, 1993). In the statistical approach proposed in 
BNS fluctuations of all magnitude in the HI distribution contribute to the signal. The statistical 
approach allows optimum use of the signal present in all the pixels of the images made at different 
frequencies across the bandwidth of a typical radio observation. In this paper we take up various 
issues related to the statistical approach originally proposed in BNS. 

The main focus of this paper is the choice of an appropriate statistical estimator to quantify 
the properties of the signal and the system noise. The statistical estimator proposed in BNS is the 
cross-correlation between the temperature fluctuations along different lines of sight in radio map 
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made at different frequencies. While this quantity is conceptually very simple, complications arise 
when it is applied to images produced by radio intcrfcromctry, as is the case with most low frequency 
radio telescopes. Such observations measure the coherence between the signals arriving at any two 
antennas, a quantity known as the visibility V^(U). This is measured for all pairs of antennas in 
the interferometric array. Here U refers to the vector joining a pair of antennas, measured in units 
of A, projected on the plane perpendicular to the direction which is being imaged. 

The image is produced by a Fourier transform of the visibility 

I^{e) = J V(U) exp[27ri 6- U] cfV (1) 

where 9 refers to different positions in the small patch of the sky which is being imaged, and 
is the specific intensity. The contribution of system noise to the signal from each antenna is 
independent, and the visibilities measured by each pair of antennas are uncorrelated. The noise in 
the pixels of a radio image constructed from the visibilities is not independent. The correlations in 
the noise in the pixels depend on the detailed distribution of the different separations (baselines) U 
for which the visibility has been measured. Any strategy based on the statistical analysis of radio 
images will be faced with the problem of distinguishing the correlations in fluctuations of the HI 
emission from the correlations in the noise. This complication can be avoided by dealing directly 
with the visibilities. In this paper we investigate the possibility of detecting the fluctuations in the 
HI emission using a statistical estimator constructed directly from the visibilities measured in a 
radio interferometric observation, without making an image. 

We next present a brief outline of this paper. In section 2 we calculate the relation between 
the statistical properties of the HI fluctuations and the visibilities produced by these fluctuations. 
In Section 3 we present numerical estimates of these quantites for some of the currently favoured 
cosmological models for the system parameters of GMRT (Swarup et al. 1990). In Section 4, we 
calculate statistical properties of the visibility correlations arising from the system noise. In Section 
5 we present the conclusions and discuss possible directions for future work. 



2. From Density Fluctuations to Visibility 

We consider radio-interferometric observations of a small patch of the sky whose center is 
in the direction of the unit vector n (Figure 1). A small patch of the sky may be treated as a 
plane, and the angle 9 which refers to different directions in the sky (Figure 1) may be treated as 
a two dimensional vector. Observations at a frequency v would measure the HI emission from a 
redshift z = (1420MHz/i^) — 1 or equivalently a comoving distance Vi,. The specific intensity of the 
redshifted HI emission arriving from any direction 9 may be decomposed into two parts 

I,{9) =h + AI,{9) , (2) 

where Ii, and AI,y{n) are the isotropic and the fluctuating components of the specific intensity. The 
isotropic component In is related to nuiiz), the mean comoving number density of HI atoms in the 
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excited state at a redshift z and we have 

- _ A21 hpcnm[z) 

where ^21 is the Einstein coefficient for the HI hyperfine transition, hp the Plank constant, c the 
speed of light and il(z) the Hubble parameter. This is a slightly rearranged version of Equation 
(11) of BNS. 

The fluctuations in the specific intensity arise from fluctuations in the HI number 

density AnHi(x) and the peculiar velocity v(x), where x refers to the comoving position x = 
r^(n + 9) The details of the calculation relating these quantities are presented in BNS, and we use 
a slightly rearranged version of Equation (12) of BNS 



AraHi(x) ^ (n- V)(n- v(x)) 



(4) 



where a is the scale factor. It should be noted that all the quantities in the right hand side of 
equation (4) should be evaluated at the epoch when the radiation was emitted. 

In this paper wc wish to calculate the contribution from the redshiftcd HI emission to the visi- 
bilities Vy{\J) that would be measured in radio- interferometric observations. The relation between 
the specific intensity and the visibilities is 



V, 



;(u) = j cPeA{e)AU{e)e-'^'''^-^ . (5) 

Only the fluctuating part of the specific intensity contributes to the visibility, and we have dropped 
the isotropic component from eq. (5). Here A(9) is the beam pattern of the individual antennas in 
the array (primary beam) . We use equations (4) and (5) to relate the visibilities to the fluctuations 
in the HI distribution. 

It is convenient to work with AHi(k), the Fourier transform of the density contrast of the HI 
number density AnHi(x)/nHi. We assume that on sufficiently large scales AHi(k) can be related to 
A(k), the density contrast of the underlying dark matter distribution, through a linear bias param- 
eter b i.e. AHi(k) = 6HiA(k). We also assume that the scales we are dealing with are sufficiently 
large that we can apply linear theory of density perturbations (Peebles 1980) to relate the pecu- 
liar velocities to the fluctuations in the dark matter distribution, v(k) = (— iai7/($7f„)k/A;)A(k), 
where /(O^) « + ^[1 — |0^(1 + 0,m)] in a spatially flat universe (Lahav et al. 1991). These 
assumptions allow us to express the fluctuations in the specific intensity as 



(2^ 



2", 

' AHi(k)e^^"'('^li+'"^-^") (6) 



where we have decomposed the wave vector k into two parts k = A;||n -|- k^ where k^^n refers to 
the component of the Fourier mode k along the line of sight to the center of the patch of sky 
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being observed, and refers to the component of k in the plane of the sky. We use equation (6) 
in equation (5) to express the visibihty V^(U) in terms of A(k). This allows us to carry out the 
integral over 9 which gives us 



V^{u,v) 



"J (27r 



AHi(k) 



1 + 



(5kj 



iri/kn 



a(U- 



27r 



(7) 



where a(U) is the Fourier transform of A{9) the primary beam, 



/ 



a(U) = / (feA{e)e 



(8) 



For a Gaussian primary beam pattern A{9) = e ^^/^o. the Fourier transform also is a Gaussian 
and we have 

a(U) = 7:9^ exp [-7:^9^U^] (9) 

which we use in the rest of this paper. 

Equation (7) relates the contribution to the visibilities from fluctuations in the HI number 
density. These fluctuations are assumed to be a Gaussian random field, or equivalently the different 
modes AHi(k) have independent, random phases. This allows us to predict all the statistical 
properties of AHi(k) in terms of the power spectrum of the fluctuations in the HI distribution 
i^i(k) which is defined as {AHj(k)AHi(k')) = (27r)^(5^(k — k')i^i(k) where () denotes ensemble 
average. We use this to calculate the correlation between the visibilities at different baselines U 
and U' and two different frequencies u and ly + Ai/. Here wc assume that the bandwidth over which 
the observations are being carried out is small compared to the central frequency i.e. Azv <^ v 
and r,^+Ai/ = Vn + r^/S.v where = Using these inputs to calculate the visibility correlation 
function we obtain 



(kx - 27rU/r^)2 + (kx - 27rU7r^)2 



X exp 



(2/r.^o)2 



(10) 



The assumption of linear bias allows us to relate PHi(k) to -P(k) the power spectrum of density 
fluctuations in the dark matter distribution through the linear bias parameter Pmi)^) = 6^-P(k). 
We use this in later sections to obtain numerical estimates for different cosmological models. 

We next turn our attention to a qualitative analysis of equation (10) to determine the nature 
and extent of the correlations between the visibilities measured at different baselines. This is 

(kx -2Tv\J/r^f + (k J, -27rU' /r^f 



largely governed by term exp 



which arises because the observations 



(2/r,0o)^ 

have a limited sky coverage determined by the primary beam pattern. This term is very small 
for all values of kj^ unless | U — U' |< 1/9q. The parameter is related to the the FWHM of 
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the primary beam and Oq ^ O.G^fwhm, which allows us to relate 0o to D the diameter of the 
individual antennas as 0q ~ X/D. We also express U and U' in terms of d and d', the physical 
separations between the pairs of antennas as U = d/A and U' = d'/A' It should be noted that 
here and throughout we assume that A' = A(l — Av/v) with Ai/ « v. Using these we see that the 
condition for the visibilities to be correlated can be expressed as | d — d' |< D. This implies that 
the visibilities measured by a pair of antennas separated by the displacement d will be correlated 
to the visibilities measured by another pair separated by a displacement d' only if difference in 
the two displacements d and d is less than the antenna diameter. The consequences of this for a 
typical antenna configuration are 

(a) The visibilities measured at various frequencies by the same pair of antennas are correlated. 

(b) The visibilities measured by different pairs of antennas are uncorrelated. 



For the rest of the paper we shall consider only the correlation between the visibilities measured at 
various frequencies by the same pair of antennas. We use the notation (V^(U)V^*^_^j^(U)) to denote 
the correlation between the visibilities measured at two different frequencies by the pair of antennas 
at a physical separation d = c\J /u. The fact that this physical separation d will correspond to a 
different baseline \j' = {v + Au)d/c at the frequency v + Av is ignored throughout as Au <C v. 
Equation (10) can now be used to obtain 



(v;(u)y;+^,(u)) 



X exp 



(kj 



(^^Hi(fc)e 
- 2^U/r^)2 



1 + /3 



2(l/r,0o) 



(11) 



The role of the Gaussian in equation (11) arising from the primary beam pattern is to ensure that 
most of the contribution is from Fourier modes for which ~ (27r/r,y)U. Equation (11) is further 
simplified if we approximate the Gaussian with a Dirac Delta function 



exp 



(kx - 27rU/r^ 



2(l/r,0o) 



27r 



5' 



27r, 



U 



(12) 



whereby only Fourier modes for which kj 
integrals in equation (11) giving us 

(K(u)y;+A.(u)) 



(27r/riy)U contribute. This allows us to do two of the 



21 2 

oj 



1+P 



/ 

Jo 



2 



with k = Jkl + (27r/ri.)2[/2 



(13) 



Equations (11) and (13) represent the main results of this section. They relate the correlation in the 
visibilities to the power spectrum of fluctuations in the HI distribution. The visibility correlations 
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at any baseline U are seen to be sensitive to Fourier modes > 2TTU/ry. It comes from the fact that 
each visibiHty measurement is sensitive to one Fourier mode in the plane of the sky, which arises 
from the projection of three-dimensional Fourier modes making different angles with the plane of 
the sky. The typical length scales ~ tt/A; that contribute to the measurement for any baseline U 
arc < 30/1-1 Mpc(100/C/). 

In the next section we use these equations to make predictions for the visibility correlations 
expected in the currently favoured cosmological models and we discuss the possibility of observing 
these. 



3. Results 

Equation (3) can be calculated to give: 

for a spatially fiat cosmological model. Here {1 + z) = 1420/z/, v being the observed frequency. 
We use ^gas{z) = 10~^ as a fiducial value throughout for 2; > 1 (Peroux et al. 2001). We give 
results for the currently-favoured cosmological model: spatially-flat with f^AO = 0-7 and = 0-3 
(Perlmutter et al. 1999, de Bernardis et al. 2000). We use h = 0.7 whenever quoting a numerical 
value (Freedman et al. 2001). 

For GMRT 6'fwhm = 1-8° x (325MHz/zy). Wc plot the visibility correlation function in Fig- 
ures 2 and 3 for the GMRT channels centered around = 325 MHz and = 610MHz for COBE- 
normalized power spectrum (Bunn & White 1996) and 6 = 1. GMRT has a total bandwidth of 
16 MHz at these frequencies in 128 channels. The visibility correlation function shown in the figures 
is averaged over one of the these channels with Av = 1.25 kHz. The correlation function is not very 
sensitive to the width of the channel so long as the channel width < a few kHz. 

For u = v', the signal (V(K(U)K'(U))) is 10-50 ^liy for baselines |U| ~ 100-1000. GMRT 
has 15 antennas in a central array within a radius of 2± 1km. Antenna pairs formed from these 
antennas will be most sensitive to the signal. 

For V 7^ f' , the correlation signal is seen to dip sharply as frequency separation is increased. 
The signal is anti-corrclatcd and drops below c± 1 /xJy for v' — u >2 MHz. For v' — v < 0.5 MHz 
the signal is 5-30 /iJy for baselines < 500. 

In Figure 2 and 3 we assume that the HI distribution follows the underlying dark matter 
distribution, .i.e. 6=1. However this may not be true; observed structures at any redshift are 
expected to be biased with respect to the underlying mass distribution (see e.g. Bardeen et al. 
1986). This bias is expected to be higher at larger redshifts and the observed strong clustering 
of high redshift galaxies is at least partly owing to this fact (Steidel et al. 1998). In this paper 
we adopt a simple model of bias and assume it to be linear and independent of the Fourier mode. 
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Though the exact dependence of the HI signal on the bias is comphcated (Eq. (13), the signal 
scales roughly linearly with bias. This means that for a moderately biased HI distribution b < 2 
the signal could be higher by a factor of two. 



4. Noise 

For each visibility measurement in the UV plane, the contribution comes from both the signal 
from HI fluctuations S^, the detector noise N^,, and various galactic and extragalactic foregrounds. 
We consider here the contribution from only the HI signal and the noise. The visibility measurement 
gives: 

F,(U) = 5,(U) + 7V,(U) (15) 

Both S and N are complex quantities with both real and imaginary parts distributed as Gaussian 

random variables (see e.g. Crane & Napier 1989 for properties of noise). The signal is a Gaussian 
random field because it is linear in density perturbation A (A;) which is expected to be a Gaussian 
random field for large scales (small k) (see e.g. Peacock 1999, Bardccn et al. 1986). The signal and 
noise are uncorrelated with each other. The reality condition of the surface brightness (Eq. 1) and 
the noise in the real space implies S'(— U) = S'*(U) and A'"(— U) = A'"*(U) Our aim is to construct 
bilinear combinations like V^(U)V^*(U) and to detect S',/(U)S'*,(U). v and u' will in general be 
different. The average signal (S',/(U)S'*,(U)) is calculated in the previous section. The average 
noise correlation, for v = v': 



(iv4u)iv:(u)) 



rp ~| 2 



(16) 



Here Tgyg is the system temperature, Az/ is the bandwidth, K is the antenna gain, and At is the 
time of integration for one visibility measurement. For v ^ v' the noise correlation vanishes as the 
noise in different frequency channels is uncorrelated. 

Estimator of the signal: From the measured visibility it is possible to write several estimators 
of the signal. The simplest such estimator is: 

s = K(u)y;,(u) - (iv,(u)iv;(u)) (17) 

This estimator is clearly unbiased, i.e. 

(5) = {S4V)s*AV)). 

The quantity of interest to us is the variance of the estimated signal: <t{S)'^ = (5^) — {S}'^. This 
quantity is calculated to be (see Appendix A for a derivation) : 

a2(5)~^(iV,(U)iV;(U))2 (18) 
n 

q = 2 foT V = u' and q = 1 for u ^ v'. n is the total number of visibility measurements. The 
signal-to-noise for the detection of the HI fluctuation signal is {S)/a{S). 
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The value of n for a given |U| in general depends on the antenna positions, frequency coverage, 
and the position of the source in the sky. To calculate the value of n we first consider the case 
when V = v' . 

Case I- u = v': To get a simple estimate, assume that a pair of antennas describes circular 
tracks in the UV plane with radius |U| and that these tracks do not overlap with the tracks of other 
antenna pairs, then n = T/At for each frequency channel, where T is the total time of observation. 
(The actual observation is more complicated because the antenna tracks cross in the UV plane.) 
For this case, Eq. (18) gives, using Eq. (16): 

a\S) = 

Before proceeding further it is useful to calculate the expected noise for the GMRT. We shall take the 
fiducial observing frequency to be ~ 320 MHz. At this frequency the GMRT system temperature 
Ts — 110 K. GMRT has a total bandwidth of 16 MHz at this frequency in 128 channels. To 
calculate the quantity in Eq. (16) we take one channel {Av = 125 kHz) and an integration time 
of At = 30 sec for one 'instantaneous' measurement for a given baseline. The antenna gain at 
this frequency K = 0.32 KJy~^. Using this Eq. (16) gives the noise correlation to be 2± 175 mJy. 
As this is much larger than the expected signal calculated in the last section, we are justified in 
neglecting the signal term in Eq. (26) in the appendix. For total time of integration T = lOhrs, 
Eq. (19) gives <t{S) ~ 5mJy. 

As each frequency channel gives a realization of the signal, the noise can be reduced further 
by using all the frequency channels. This gives n = A^chan x T/At with Achan = 128 for GMRT; 
Eq. (19) gives a{S) ~ 0.45 mJy. Even though we made a few simplifying assumptions in calculating 
the noise, this is the typical value obtainable in a real experiment. The expected noise is much 
larger than the expected signal from the HI fluctuations and therefore an experiment like GMRT 
cannot detect the HI fluctuation for a given |U| or by using a single pair of antennas in a reasonable 
amount of integration time. 

To reduce the noise further one must consider averaging the signal over more than one pair of 
antennas. One such estimator is the map RMS which uses information from all possible baselines. 
The total number of 'instantaneous' baselines for an experiment with A antennas is A(A — l)/2. 
This gives n = Nc\ianN{N — l)/2 x T/At and gives a further decrease of a factor ~ A/\/2 in the 
sensitivity. In the previous section we showed that much of the contribution to signal comes from 
baselines < lOOOA. GMRT has 15 antennas in the central array within a radius of ~ 1 km. Much 
of the contribution to the signal will come from these antenna pairs. 

Taking A^ = 15 in the calculation of noise sensitivity, we get £7(5) ~ 40/xJy for 10 hours 
of integration. The average signal (averaged over 15 antennas of the central GMRT array) is 
~ 20-40 ^Jy. This means that a few sigma detection of the signal might be feasible in integration 
time < 100 hrs using the central array of GMRT. 

Case II- V 7^ v': In this case, (A,^(U)Ar*(U)) = and the variance of the signal estimator 



2T 



sys 



K^AvT 



(19) 
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is smaller by a factor of 2 (Eq. (18)). The rest of the calculations proceeds similar to the first 
case. The number of distinct pairs for two different frequencies will depend on the separation of 
the frequencies. However, one must also take into account the line width of the damped Lyman-a 
clouds which is ~ 200kmsec~^ (Prochaska & Wolfe 1998). The line width of each GMRT channel 
is ~ 120kmsec'"^. This means that one damped Lyman-a cloud will spill over in many frequency 
channels, thereby creating a correlation in the signal for nearby channels. This correlation must be 
accounted for before the HI fluctuation signal can be extracted. It is hard to do it analytically and 
this issue will be addressed in future using simulations of the HI signal. However it is possible to 
get the typical noise sensitivity for this measurement. 

The total number of frequency channel pairs is -/Vchan(-^chan — l)/2- Each pair gives a different 
realization of noise. If we average the signal over all baselines and all frequency pairs, we get 
n = N{N — l)/2iVchan(^chan " l)/2- This is larger than the maximum value of n in the first case 
by a factor of ~ A^'chan/2. However it would be meaningful to average over all channel-pairs if the 
signal is significant for all such cross-correlations. As seen in the previous section, the correlation 
between different channels falls rapidly for separation > 1 MHz, therefore the number of useful 
channels pairs is less than the maximum possible. The value of n however is still likely to be more 
than in the previous case. For example if we average the signal over all the frequency channels with 
f' — V < 0.5 MHz, the expected signal is ~ 10-20 //Jy. The number of frequency pairs are ~ SA'^chan 
in this case. This gives cr(S') ~ 15 /xJy for ten hours of integration. From this discussion we can 
conclude that it might also be possible to extract this signal for integration time < 100 hrs using 
GMRT. 

The noise in detecting the HI signal cf{S) is comparable to the sensitivity for detecting con- 
tinuum sources. This is so even though the HI clouds emit line radiation. Therefore the method of 

observing fluctuations in HI radiation makes more optimal use of all the frequency width available 
in the experiment. The individual clouds are very faint (flux < 10 ^Jy (Saini et al. 2001)) and 
cannot be detected using GMRT because the line sensitivity ~ 50 //Jy for 100 hrs of integration 
needed to detect the HI fluctuations. 

5. Conclusions and Discussion 

Our main results are: 

1. The correlation in measured visibilities owing to density inhomogeneities and peculiar veloc- 
ities of the HI distribution at high redshifts can be related to the power spectrum of the HI 
distribution (Eq. (13)). The visibility correlation for any baseline U is sensitive to Fourier 
modes > 27111 /vi,. This means that the typical length scales probed for any baseline are 
< 30/1-1 Mpc(100/C/). 

2. The signal is strongest for baselines < 1000 and for u = v\ i.e. on the same two-dimesional 
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map, the correlation is between 2 x 10 ^ Jy^ and 10 ^'^ Jy^. 

3. For V 7^ z/', i.e., cross-correlation signal, the correlation signal is lO^^-lO"^-*^ Jy^ for baselines 
|U| < 500 for y' — y ^ 0.5 MHz. The correlation is negative for most baselines for v' — v >^ 
2 MHz and falls below 10-^^ jy2 

4. GMRT might detect these signals for integration times < 100 hrs. We argue that measur- 
ing visibility correlations in the presence of system noise makes optimal use of the fact that 
the noise is uncorrelated across baselines and frequency channels. The error for these mea- 
surements is comparable to and can even be smaller than the continuum sensitivity of the 
instrument. 

The signal and noise analyses given in this paper are for the system parameters of the currently- 
operational GMRT. However it can be easily extended to future telescopes like Square Kilometer 
Array (SKA) ^ and Low Frequency Array (LOFAR) ^. Our analysis can also be extended to higher 
redshifts (z ~ 5) as the HI content of the universe at these redshifts is beginning to be known (see 
e.g. Peroux et al. 2001). 

In this paper we neglected two other contributions to the visibility correlations: galactic and 
extra-galactic foregrounds and the Poisson fluctuations owing to point-like nature of HI clouds. The 
galactic foregrounds are expected to be dominated by the fluctuations in the synchrotron radiation 
from the Galaxy. The only existing all-sky map at low radio frequencies is the 408 MHz Haslam 
map (Haslam et al. 1982). This map has an angular resolution of 2± 1° and therefore cannot give 
much information on the angular scales of interest to us. The extra-galactic foregrounds get most 
of its contribution from the radio point-sources. Little is know about the radio point sources at 
sensitivity levels (< 100 ^Jy) and frequencies of relevance in this paper. However it seems likely 
that these foregrounds will dominate the HI signal (BNS 2001), and a possible strategy to remove 
foregrounds was discussed in BNS (2001). This issue will be discussed in a later paper by using 
simulations of the HI signal and the foregrounds. 

The HI at large redshift is locked up in discrete clouds. This will give rise to visibility correla- 
tions even in the absence of gravitational instability. This signal also depends on the mass function 
of the HI clouds (Saini et al. 2001) and therefore the detection of this signal can give important 
clue about how the HI at high redshift is distributed. We shall attempt to estimate this signal from 
simulation of the high redshift HI in a later publication. 



^soo http://www.nfra.nl/skai/ 
"^see http://www.astron.nl/lofar 
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Appendix A 

The HI fluctuation signal and the noise satisfy the following conditions: 

{S,{\J)S,,(U')) = (5,(U)5,,(U))<5d(U-U') (20) 
{N,{U)N,,{V')) = {N,{U)N,{U))6d{V-V')Sd{i^-i^') (21) 

Note that the signal is correlated across frequency channels while the noise is not. Let us assume 
that any interferometric experiment makes n measurements of the visibility for given |U| and the 
quantities (like visibility correlation function) are estimated by averaging over these measurement. 
Using this it is seen that the signal (for u = i/') and the noise can be treated as n uncorrelated, 
random variables with the same mean and variance. In such a case, the estimated average equals 
the average of any of the random numbers and the variance of the estimated 'signal' is 1 /n times 
the variance of any of the random variable (sec e.g. Papoulis 1965; this result is independent of the 
probability distribution functions of the individual variables). Of particular interest to us is the 
variance of the estimator in Eq. (17) for a given The variance is estimated from n realizations 
of the random variable S. It is given by: 

a\S) = ^ (22) 

Here o"^ = (S*^) — (S*)^ is the variance of any realization of S. 

It is given by, using the definitions of V and N from Eq. (15): 

= (KK'KK') + {N^.N^'f - 2{N^N^,) (KK') - {S^S^^'f (23) 

To simplify this expression further we use the fact that for a Gaussian random process, the expec- 
tation value of four random numbers is given by 

{X1X2X3X4) = {XIX2){X^X^) + {xiXz){x2X^) + {x2Xi){xiXz) 

We first consider the case when v = v' . Eq. (23) then reduces to: 

= 3(KK')^ + {N^N^,)^ - 2{N^N^,){V^V^,) - {S^S^.f (24) 



Again using Eq. (15) for the definition of (V,/V^), this simplifies to: 

= 2 X {{S,S,) + {N,N,)f 



(25) 
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The case when u = i/' is slightly more complicated. Making a simplifying assumption that the signal 
contribution can be dropped while calculating the four-point functions in Eq. (23) (for justification 
see the text), we get: 

~ {N^N^f (26) 
Eqs. (25) and (26) along with Eq. (22) gives Eq. (18). 
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Observer 

Fig. 1. — The geometry for the flat-sky approximation is shown. 




Fig. 2. — For the central frequency u = 320 MHz, this figure shows the visibihty correlation function 
as baseline U varies. The thick curve shows the visibility correlation for v = v' . The other three 
curves show, from top to bottom, the visibility correlation for v' — v = {0.5, 1, 2} MHz, respectively. 
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Fig. 3. — Same as Figure 2 for the central frequency v = 610 MHz 



